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© Cache device for supplying a fixed word length of a variable ie igth instruction code and 
instruction fetch device. 



© A cache device supplies a given word length of 
a variable-length instruction code and an instruction 
fetch device. The cache device realizes an instruc- 
tion cache (3) capable of supplying only the desired 
word length of an instruction code designated by an 
arbitrary address irrespective of the instruction align- 
ment. The cache device also realizes a branch his- 
tory table (4) capable of reading the branch destina- 
ticn^adcxsss_pf the branch instruction which is posi- 
CO tioned. nearest to the read address in the word length 
<C supplieo by the instruction cache with the read ad- 
dress as the origin. The instruction fetch device is 
<j5 capable of always reading an instruction code of the 
00 desired word length and performing only the predic- 
^ Uon of a branch of that branch instruction which is to 
be executed first irrespective of the position and 
CO flumcer of branch instructions included in the in- 
q siruciion code read. The cache device realizing the 
instruction cache selects one of the entries whose 
qj stored address or pan [hereof coincides with ina 
read address and at least one another entry holding 
•the storage contents contiguous to the storage con- 



tents storeo in the first-mentioned entry and it in- 
cludes storage contents selecting means (312-315) 
so that the storage contents stored in the selected at 
least one or more entries are selected by every 
storage unit of the storage contents in correspon- 
dence to the read address. The cache device realiz- 
ing the branch history table selects one of the en- 
tries whose stored identifying information coincides 
with that identifying information which identifies the 
subject storage contents of a reading operation and 
at least one another entry falling within a giver, range 
determined by the stored tdentliying information and 
it includes storage contents selecting means (322) to 
select from the selected at feast one or more entries 
the one whose stored identifying information differs 
in a minimum amount from the identifying informa- 
tion for identifying the subject storage contents of 
the reading operation and the storage contents 
scored in this entry are outputted. Also. :he instruc- 
tion tetcn device includes ins;ruc:«cn storage means 
(3) formed by the cache cevice realizing :ne mstruc* 
•ion cacne anc the mstrucnon storage means »s 
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searched by the value of address holding means (1) 
to output an instruction string of a predetermined 
word length stored therein. At the same time, if the 
outputted instruction string includes no branch in- 
struction whose branch has been taken in the past, 
address incrementing means (5) generates an ad- 
dress of an instruction string which is to be outpa- 
ced next from the instruction storage means, where- 
as if the outputted instruction string includes any 



branch instruction whose branch has been taken 'in 
the pas;, branch predicting means (4) formec by the 
cache cevice realizing the branch history table is 
searched by the value of the read address holding ' 
. means thereby outoutting the branch destination ad- 
dress c? ihe branch instruction which is nearest to 
the reac address and included in the instruction 
string read from the instruction storage means. 
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© Cache device for supplying a fixed word length of a variable length instruction code and 
instruction fetch device. 



© A cache device supplies a given word length of 
a variable-length instruction code and an instruction 
fetch device. The cache device realizes an instruc- 
tion cache (3) capable of supplying only the desired 
word length of an instruction code designated by an 
arbitrary address irrespective of the instruction align- 
ment. The cache device also realizes a branch his- 
tory table (4) capable of reading the branch destina- 
tion address of the branch instruction which is posi- 
CSJ tioned nearest to the read address in the word length 
^supplied by the instruction cache with the read ad- 
dress as the origin. The instruction fetch device is 
CO capable of always reading an instruction code of the 
GO desired word length and performing only the predic- 
CNJ tion of a branch of that branch instruction which is to 
P^be executed first irrespective of the position and 
^number of branch instructions included in the in- 
©struction code read. The cache device realizing the 
n instruction cache selects one of the entries whose 
t f i stored address or pan thereof coincides with the 
read address and at least one another entry holding 
the storage contents contiguous to the storage con- 



tents stored in the first-mentioned entr/ and it in- 
cludes storage contents selecting means (312-315) 
so that the storage contents stored in the selected at 
least one or more entries are selected by every 
storage unit of the storage contents in correspon- 
dence to the read address. The cache device realiz- 
ing the branch history table selects one of the en- 
tries whose stored identifying information coincides 
with that identifying information which identifies the 
subject storage contents of a reading operation and 
at least one another entry falling within a given range 
determined by the stored identifying information and 
it includes storage contents selecting means (322) to 
select from the selected at least one or more entries 
the one whose stored identifying information differs 
in a minimum amount from the identifying informa- 
tion for identifying the subject storag contents of 
the reading operation and the storage contents 
stored in this entry are outputted. Also, the instruc- 
tion fetch device includes instruction storage means 
(3) formed by the cache device realizing the instruc- 
tion cache and the instruction storage means is 
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searched by the value of address holding means (1) 
to output an instruction string of a predetermined 
word length stored therein. At the same time, if the 
outputted instruction string includes no branch in- 
struction whose branch has been taken in the past, 
address incrementing means (5) generates an ad- 
dress of an instruction string which is to be output- 
ted next from the instruction storage means, where- 
as if the outputted instruction string includes any 
branch instruction whose branch has been taken in 
the past, branch predicting means (4) formed by the 
cache device realizing the branch history table is 
searched by the value of the read address holding 
means thereby outputting the branch destination ad- 
dress of the branch instruction which is nearest to 
the read address and included in the instruction 
string read from the instruction storage means. 
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CACHE DEVICE FOR SUPPLYING A FIXED WORD LENGTH OF A VARIABLE INSTRUCTION CODE AND 

INSTRUCTION FETCH DEVICE 



BACKGROUND OF THE fNVENTION 

The present invention relates to a central pro- 
cessing unit of a computer and more particularly to 
a cache device and an instruction, fetch device for 
supplying a fixed word length of a variable-length 
instruction code and predicting the branch destina- 
tion of a branch instruction. 

Fig. 8 shows the construction of a conventional 
instruction fetch device employing a cache device 
for performing a branch destination prediction. In 
the Figure, numeral 7 designates a 29-bit read 
pointer for holding the leading address of an in- 
struction code to be read, 8 an instruction cache, 
and 9 a branch history table for storing as a set the 
address of each branch instruction taken and the 
address of the branch* destination. The address of 
an instruction code to be read is stored in the read 
pointer 7 and the instruction cache 8 and the 
branch history table 9 are searched. When the 
instruction code read out from the instruction cache 
8 includes any branch instruction taken in the past, 
it is expected that its branch destination address is 
also read from the branch history table 9. In this 
case, this branch destination address is read and 
stored in the read pointer 7 and the instruction 
cache 8 is searched thereby reading the instruction 
code at the branch destination without waiting until 
the decoding and execution of the branch instruc- 
tion. 

However, where branch instructions BccO and 
Bed are respectively present at the positions of 
addresses n and n + 4 as shown in the instruction 
alignment diagram in Fig. 9 A, after the high-order 
29 bits of the address n have been stored in the 
read pointer 7, even if the branch history table 9 is 
searched by the high-order 29 bits of the address n 
so as to obtain the branch destination address of 
the branch instruction BccO or Bed, both of the 
two entries storing the branch destination address- 
es hit and thus the desired branch destination 
address cannot be obtained. Also, where an in- 
struction OP with a 32-bit displacement operand 
disp 32 is present at the position of an address 
n + 6 as shown in the instruction alignment diagram 
in Fig. 9B, in predicting a branch instruction for 
branching; to the address n + 6 from a certain ad- 
dress, even if the branch destination address ob- 
tained from the branch history table 9 is the ad- 
dress n + 6. the instruction code read from the 
instruction cache 8 does not include the displace- 
ment operand disp 32 and a wait is required for th 
decoding or the instruction OP until the displace- 
ment operand disp 32 is read out again. 



SUMMARY OF THE INVENTION 

It is an object of the present invention to pro- 
vide a cache device which realizes an instruction 

5 cache capable of supplying only the desired word 
length of an instruction code designated by any 
given read address irrespective of the instruction 
alignment and which further realizes a branch his- 
tory table capable of reading the branch destination 

w address of a branch instruction positioned nearest 
to the read address in the word length supplied by 
an instruction cache with the read address as the 
origin, and an instruction fetch device capable of 
always reading an instruction code of the desired 

is word length from an instruction cache and also 
capable of effecting only the prediction of branch- 
ing of the branch instruction to be executed first 
even if the branch instruction is present at any 
position in the instruction code read or a plurality of 

20 branch instructions are included in the instruction 
code7~~ 

To accomplish the above object, in accordance 
with the present invention there is thus provided a 
cache device which realizes an 'instruction cache 

25 such that any entry whose stored address or a part 
thereof coincides with the read address and at 
least one another entry holding the stored contents 
contiguous to the stored contents of the said entry 
are selected and storage contents selecting means 

30 is provided to read the storage contents stored in 
the selected at least one or more entries and 
thereby to select the storage contents by every 
storage unit in correspondence to the read ad- 
dress. 

3s In accordance with another aspect of the 

present invention, there is provided a cache device 
which realizes a branch history table such that any 
entry whose stored identifying information coin- 
cides with identifying information identifying the 

40 stored contents subjected to a read operation and 
at least one another entry included in a given range 
defined by the stored identifying information are 
selected, and storage content selecting means is 
provided to select one of the selected at least one 

45 or more entries whose stored identifying informa- 
tion has the least difference from the identifying 
information identifying the storage contents sub- 
jected to the read operation and the storage con- 
tents stored in this entry are outputted. 

50 In accordance with still another aspect of the 

present invention, there is provided an instruction 
fetch device so designed that instruction storage 
means including such cache device realizing an 
instruction cache is searched in accordance with 



3 



3 



EP 0 372 865 A2 



ihe value of read address holding means and its 
stored instruction string of a predetermined word 
length is outputted. Simultaneously, if the outputted 
instruction string includes no branch instruction 
whose branch has been taken in the past, address 
incrementing means generates an address of an 
instruction string which is to be outputted next by 
the instruction storage means, whereas if the out- 
putted instruction string includes any branch in- 
struction whose branch has been taken in the past, 
branch predicting means including the cache de- 
vice realizing the branch history table is searched 
in accordance with the value of the read address 
holding means thereby outputting the branch des- 
tination address of the branch instruction which is 
nearest to the read address and which is included 
in the instruction string read from the instruction 
storage means. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram showing a cache 
device for supplying a fixed word length of a 
variable-length instruction code and an instruction 
fetch device according to a first embodiment of the 
present invention. 

Fig. 2 is a detailed block diagram of the 
instruction cache 3 in the first embodiment. 

Fig. 3 is a detailed block diagram of the 
branch history table 4 in the first embodiment. 

Fig. 4A is a detailed block diagram of the 
read entry preference circuit 409 of the branch 
historytable 4 in the first embodiment. 

Fig. 4B shows the relation between the in- 
puts A and 8 and the outputs OUT and CRY. 

Figs. 5A to 5E show instruction alignments 
useful for explaining the operation of the instruction 
cache 3 in the first and second embodiments, 
respectively. 

Fig. 6 shows instruction alignments useful for 
explaining the operation of the branch history table 
4 in the first embodiment. 

Fig. 7 is a detailed block diagram showing 
the cache device for supplying a fixed word length 
of a variable-length instruction code and the in- 
struction fetch device in the second embodiment. 

Fig. 8 is a block diagram showing a conven- 
tional instruction fetch device employing a cache 
device for performing a branch destination predic- 
tion. 

Figs. 9A and 98 show instructions align- 
ments useful for explaining the operation of a 
branch history table 9 in a conventional instruction 
fetch device. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 



Fig. 1 is a block diagram showing a cache 
device for supplying a fixed word length cf a 
variable-length instruction code and an instruction 
fetch device according to a first embodiment or the 
5 present invention. In the cache device and ihe 
instruction fetch device according to the first em- 
bodiment, the word length per instruction is an 
integral multiple of 16 bits (hereinafter referred to 
as a half word) and the instruction code is aligned 
io on the half-word boundary (the lowest-order bit of 
the address is 0) of a 32-bit address space. Also, 
the address is assigned to each 8' bits and the 
instruction code is identified by the address as- 
signed to its leading 8 bits. In Fig. 1 t numeral 1 

75 designates a read pointer for holding the high-order 
29 bits of the leading address of a 4-half-word 
instruction code to be read, 2 a half-word identify- 
ing register for holding the 2 bits following the 
high-order 29 bits of the leading address of the 4- 

20 half-word instruction code to be read which is held 
in the read pointer 1 (the 2 bits following the high- 
order 29 bits of the address is hereinafter reierred 
to as a half word identifying 2-bits), 3 an instruction 
cache for storing as a set each instruction code 

25 aligned on the 4-half-word boundary (the low-order 
3 bits of the address are Os) and its leading ad- 
dress and selecting the instruction code stored in 
the entry searched by the value of the read pointer 
1 and the value of the half-word identifying register 

30 2 to output the 4-half-word instruction code, 4 a 
branch history table for storing as a pair each 
address of a branch instruction whose branch has 
been, taken and its branch destination address 
whereby the entry searched according to the value 

as of the read pointer 1 and the value of the half-word 
identifying register 2 is selected to read the stored 
branch destination address, 5 a pointer incrementer 
for adding a 1 to the high-order 29 bits of the read 
address held in the read pointer 1 . and 6 a pointer 

40 input selector responsive to the hit signal from the 
branch history table 4 to choose between the out- 
put of the pointer incrementer 5 or the high-order 
29 bits of the read address which is incresased by 
1 and the high-order 29 bits of the branch destina- 

45 tion address obtained from the branch history table 
4. 

Fig. 2 is a detailed block diagram of the in- 
struction cache 3 shown in Fig. 1. In the Figure, 
numeral 301 designates an address tag section 

so including 128 entries each storing the high-order 
22 bits of an address and searched in accordance 
with the value of the read pointer 1, 302 a data 
section including 128 entries each storing a 64-bits 
instruction code aligned on a 4-half-word boundary 

55 whereby the instruction code in the entry searched 
in the address tag section 301 is selected by the 
value of the half-word identifying register 2 to out- 
put the 4-half-word instruction code, 303 a 7-bit 
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selector. 304 an address decoder for decoding the 
high-order 7 bits or the 23rd to 29th bits of the 
address to select 2 entries for reading and writing 
opposes. 305 address latches each for holding the 
high-order 22 bits of an address. 306. 307. 308 and 

309 are data latches for respectively holding the 
highest half word (hereinafter referred to as a first 
half word), the second highest half word 
(hereinafter referred to as a second half word), the 
third highest half word (hereinafter referred to as a 
third half . word) and the lowest half word 
(hereinafter referred to as a fourth half word) of an 
instruction code aligned on a 4-half-word boundary. 

310 address driver each for performing the reading 
and writing of the high-order 22 bits of the address 
with respect to one of the address latches 305 
corresponding to one of the two entries selected by 
the address decoder 304 which is smaller .n ad- 
dress 31 1 address drivers each for performing the 
reading of the high-order 22 bits of the address 
with respect to one of the address latches 305 
corresponding to one of the two entries selected by 
the address decoder 304 which is greater in ad- 
dress 312 data drivers each for performing the 
writing of the first half word of the instruction code 
with respect to one of the data latches 306 cor- 
responding to one of the two entries selected by 
the address decoder 304 which is smaller in ad- 
dress as well as the reading when the value of the 
half-word identifying register 2 is 0 and [or per- 
forming the reading of the first half word o die 
instruction code with respect to the other data latch 
306 corresponding to the other entry wh.ch » 
greater in address when the value of the half-word 
identifying register 2 is greater than 1. 313 data 
driver for each performing the writing of the second 
half word of the instruction code with respect to 
one of the data latches 307 corresponding to one 
of the two entries selected by the address decoder 
304 which is smaller in address as well as the 
reading when the value of the half word identifying 
register 2 is smaller than 1 and for performing the 
reading of the second half word of the instruction 
code with respect to the other data latch 307 
corresponding to the other entry which is greater in 
address when the value of the hart-word identifying 
register 2 is greater than 2. 314 data drivers each 
for performing the writing of the third half word of 
the instruction code with respect to one of the data 
latches 308 corresponding to one of the two entries 
selected by the address decoder 304 as well as 
the reading when the valu of the half-word iden- 
tifying register 2 is less than 2 and for performing 
the reading of th third half-word of the '"strucUon 
code with respect to the other data latch 308 
corresponding to the other entry which is greater .n 
address when the value of the half-word identifying 
register 2 is 3. and 315 data drivers each tor 



performing the writing and reading of the fourth half 
word of the instruction code with respect to one of 
the data latches 309 corresponding to one of the 
two entries selected by the address decoder 304 
s which is smaller in address. The suffixes i and j of 
the numerals 305 to 315 indicate respectively the 
ith and jth entries. Numeral 316 designates a first 
address tag comparator for detecting the presence 
of equality between the value of the address latch 
io 305 corresponding to one of the two entries se- 
lected by the address decoder 304 which is small- 
er in address and the high-order 22 bits of the read 
address held in the read pointer 1. 317 a second 
address tag comparator for detecting the presence ' 
, 5 of equality between the value of the address latch 
305 corresponding to one of the two entries se- 
lected by the address decoder 304 which is greater 
in address and the high-order 22 bits of the read 
address held in the read pointer 1. and 31 8 an 
20 AND circuit for producing the logical product of the 
outputs from the first and second address tag 
comparators 316 and 317 to generate a signal 
indicating that the instruction codes in the data, 
sections of the two entries selected by the address 
25 decoder 304 are contiguous. 

Referring now to Fig. 3. there is illustrated a 
detailed block diagram of the branch history table 4 
shown in Fig. 1. In the Figure, numeral 401 des- 
ignates an address tag section for storing the h.gh- 
30 order 29 bits of the address of each branch instruc- 
tion so as to be searched by the value of the read 
pointer 1 , 402 a data section for storing the half- 
- word identifying 2 bits of each branch instruction 
address and the high-order 31 bits of each branch 
* 35 destination address so that the entry searched in 
the address tag section 401 is selected in accor- 
dance with the half-word identifying 2 bits of the 
branch instruction address and the value of the 
half-word identifying register 2. thereby outputting 
.o the branch destination address, ^ address 
latches each for holding the high-order 29 bits of a 
branch instruction address. 404 pre-address 
latches each for holding the value obtained by 
subtracting 1 from the high-order 29 bits ; of * 
,s branch instruction address. 405 "ranch position 
identifying latches each for holding the half-word 
identifying 2 bits of -a branch instruction address 
406 data latches each for holding the high-order 31 
bits of a branch destination address. 407 first ad- 
50 dress tag comparators each *J**<#* £ 
equality between the value of the address latch 403 
end the read address held in the read pointer 1. 
408 second address tag comparators each for de- 
tecting the equality between the value of the pre- 
c 5 address latch 404 and the read address held m the 
" read pointer 1. 409 read entry preference circuits 
each so designed that of the entry where the value 
of the half-word identifying register 2 is equal to or 
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greater than the value of the branch position iden- 
tifying latch 405 with respect to the entries where 
the value of the address latch 403 detected by the 
first address tag comparator 407 is coincident with 
the read address and the entry where the value of 
the half-word identifying register 2 is smaller than 
the value of the branch position identifying latch 
405 with respect to the entries where the value of 
the pre-address latch 404 detected by the second 
address tag comparator 408 is coincident with the 
read address, the entry where the difference be- 
tween the value of the branch position identifying 
latch 405 and the value of the half-word identifying 
register 2 (the 2-bit unsigned subtraction result) is 
smallest is selected, and 410 data drivers each for 
reading and writing a branch destination address 
into the data latch 406. The suffixes i and j to the 
numerals 403 to 410 designate respectively* the ith 
and jth entries. Numeral 411 designates an address 
subtractor for subtracting 1 from the high-order 29 
bits of a branch instruction address. 412 an OR 
circuit for producing the logical sum of the outputs 
of the read entry preference circuits 409 of all the 
entries to generate a signal indicating that the 
branch history table 4 bits thereby outputting an 
effective branch destination address t and 417 a 
replace entry decision circuit for determining that 
entry which is written when storing a branch des- 
tination address in the data section 402. 

Fig. 4A is a detailed block diagram of the read 
entry preference circuit 409 for each entry shown 
in Fig. 3. Numeral 413 designates a subtractor for 
receiving the value of the branch position identify- 
ing latch 405 as A input and the value of the half- 
word identifying register 2 as a B input to output 
the result OUT of the subtraction A-8 and a carry 
CRY and the relations between the inputs A and B 
and the outputs OUT and CRY are as shown in Fig. 
4B. Numeral 414 is a decoder for decoding the 
difference OUT from the subtracter 413, 415 a 
logical circuit for receiving the outputs of the first 
address tag comparator 407, the second address 
tag comparator 408 and the subtracter 413 so that 
the entry where the value of the half-word identify- 
ing register 2 is greater than or equal to the value 
of the branch position identifying latch 405 is se- 
lected from those where the value of the address 
latch 403 is equal to the read address and the 
entry where the value of the half-word identifying 
register 2 is smaller than the value of the branch 
position identifying latch 405 is selected from those 
where the value of the pre-address latch 404 is 
equal to the read address, and 416 a logical circuit 
responsive to the outputs of the decoder 414 to 
select from the entries selected by the logical 
circuit 415 the one which produces the minimum 
value when the value of the half-word identifying 
register 2 is subtracted from the value of the 



branch position identifying- latch 405 (i.e., the value 
representing the unsigned 2-bit subtraction result)/ 
With the construction described above, the op- 
eration of the cache device and the instruction 
fetch device according to the first embodiment will 
now be described. 

The high-order 29 bits and the succeeding 2 
bits of an address to be read are respectively 
stored in the read pointer 1 and the half-word 
w identifying register 2 and simultaneously the in- 
struction cache 3 is searched by the value of the 
read pointer 1 obtained through a signal line 1001. 

Firtly, a description will be made of a case in 
which the address is the one which is to be read 
is for the first time. 

Since the address to be searched is the one 
which is read for the first, time, the instruction 
cache 3 does not hit. At this time, the value of the 
half word identifying register 2 is ignored and a 4- 
20 half-word instruction code separated by a 4-haif- 
word boundary is read from an external storage. At 
this time, the high-order 29 bits (the low-order 3 
bits are Os and not needed) of the read address 
and the 64-bit instruction code read are respec- . 
25 tively stored through signal lines 1003 and 1004 
into the instruction cache 3. In the instruction cache 
3, of the 29-bit address obtained from the signal 
line 1003 the low-order 7 bits are applied to the 
address decoder 304 through the selector 303 and 
30 the entry for storing them is selected. Assuming 
now that the ith entry shown in Fig. 2 is selected, 
of the outputs from the address decoder 304, the 
signal appears only on a signal line 3001 i. The 
signal on the signal line 3001 i brings the address 
os driver 31 Oi into operation so that the high-order 22 
bits of the address are inputted from the signal line 
1003 and are written into the address tetch 305i. 
Similarly, the data drivers 3121, 313i, 314i and 315i 
are brought into operation so that the first, second, 
40 third and fourth half words of the instruction code 
are inputted through the signal lines 1004 and are 
written into the data latches 306i, 307i, 308i and 
309i, respectively. Although the signal line 300 1i is 
also connected to the address drivers 31 1j. 3i2j, 
45 31 3j and 31 4j, ail of them are not operated during 
the writing. 

Paralleling the above mentioned operation, the 
pointer incremented adds 1 to the hi^h-order 29 
bits of the read address in the read pointer 1. 

50 However, if this incremented address is the one 
which is read for the first time, the instruction 
cache 3 does not hit and the previously mentioned 
storing of an instruction code in the instruction 
cache 3 is repeated. 

55 Where, the 4-haJf-word instruction code read 

from the external storage is decoded and executed 
by an external decode and execution unit so that 
the instruction code includes a branch instruction 
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and a branch is taken, independently of the stonng 
of the instruction code in the instruction cache 3, 
the high-order 31 bits of the branch instruction 
address and the high-order 31 bits of the branch 
destination address (in the both cases the lowest- 
order bit is 0 and not needed) are respectively 
stored through signal lines 1007 and 1008 in the 
branch history table 4. In the branch history table 
4 the replace entry decision circuit 417 selects the 
desired entry for storage. Assuming now that the 
ith entry shown in Fig. 3 is selected, the high-order 
29 bits of the branch instruction address are input- 
ted through the signal line 1007 and written into the 
address latch 403i and simultaneously the h.gh- 
order 29 bits are decreased by 1 and written into 
the pre-address latch 404i through the address 
subtracter 411 and a signal line 4004. At the same 
time the half-word identifying 2 bits of the branch 
instruction address are written into the branch posi- 
tion identifying latch 405i. and also the data driver 
41 Oi is operated so that the high-order 31 bits of 
the branch destination address are inputted through 
the signal line 1008 and are written into the data 
latch 406i. 

Next a description will be made of a case in 
which an instruction code has been read from the 
external storage and already stored in the instruc- 
tion cache 3 so that its read address is divided into 
the high-order 29 bits and the succeed.ng 2 bits 
and they are respectively assigned to the read 
pointer 1 and the half-word identifying register 2. In 
this case, the read address need not be the one on 
the 4-half-word boundary. 

The address to be searched is equal to the 
. address held in the address tag section 301 and 
therefore the instruction cache 3 hits. In other 
words, in the instruction cache 3 the low-order 7 
bits of the 29-bit address inputted from the read 
pointer 1 through the signal line 1 001 are applied 
to the address decoder 304 through the selector 
303 to select the entry to be read. Assuming now 
that the ith entry shown in Fig. 2 is selected, the 
address decoder 304 generates its outputs so that 
the signal appears only on the signal line 3001 1. In 
response to the signal on the signal line 3001.. the 
address drivers 31 0i and 31 Oj come into operation 
so that the values of the address latch 30Si and the 
address latch 30Sj are respectively read onto signal 
lines 3002 and 3003 and are compared with the 
high-order 22 bits of the read address from the 
read pointer 1 in the first and second address tag 
comparators 316 and 317. Since the values read 
onto the signal lines 3002 and 3003 represent the 
high-order 22 bits of the address which has been 
read in the past, the comparison exists in the both 
cases and the ANO circuit 318 generates on a 
signal line 1006 a hit signal indicating that the 
instruction codes in the data parts of the ith and ,th 



entries are contiguous. Also, while the signal on the 
signal line 3001 i brings the data drivers 31 2i to 
31 4j into operation, their operations differ depend- 
ing on the value of the half-word identifying register 

5 2 obtained through a signal line 1002. This fact will 
now be described with reference to the instruction 
alignment diagrams of Figs 5A to 5E. Shown in Fig. 
5A is the case where an instruction code OPo - 
OP? including 8 half words is present following an 

to address n on the 4-half-word boundary. 

(1) When the value of the half-word identifying 
register 2 is 0 

In response to the signal on the signal line 3001 i 
and the value of the signal line 1002. the data 
drivers 31 2i, 3131. 31 4i and 31 5i come into opera- 
tion so that the first, second, third and fourth half 
20 words OPo. OP,. OP 2 and OP 3 of the instruction 
code which are respectively held in the data 
latches 306i. 307i. 308i and 309i and aligned on the 
4-half-word boundary are outputted onto signal 
lines 3004, 3005. 3006 and 3007. thereby produc- 

25 ing the instruction code including four half words 
beginning at the first half word on the 4-half-word 
boundary indicated by the read address n stored in 
the read pointer 1 and the half-word identifying 
register 2 through signal lines 1005 as shown in 

30 Fig. 58. 

(2) When the value of the half-word identifying 
register 2 is 1 

3S In response to the signal on the signal 3001 i and 
the value of the signal line 1002. the data dnvers 
3131 31 4i. 31 5i and 31 2j come into operation so 
that the second, third and fourth half words OP,. 
OP 2 and OPa of the instruction code aligned on the 
40 4-half-word boundary which are respectively held in 
the data latches 307i. 3081 and 309i and the first 
half word OP* of the instruction code aligned on 
the upwardly contiguous 4-half-word boundary 
which is held in the data latch 306j. are respec- 
ts tively outputted onto the signal lines 3005. 3006. 
3007 and 3004, thereby producing the instruction 
code including the four half words beginning at the 
second half word on the 4-half-word boundary in- 
dicated by the read address n + 2 stored in the 
so read pointer 1 and the half word identifying register 
2 through the signal lines 1005 as shown in Fig. 
5C. 

(3) When the value of the half-word identifying 
ss register 2 is 2 

in resoonse to the signal on the signal line 3001 i 
and the value of the signal line 1002. the data 
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drivers 31 4i, 31 5i. 31 2j and 31 3j come into opera- 
tion so that the third and fourth half words OP2 and 
OP3 of the instruction code aligned on the 4-half- 
word boundary which are respectively held in the 
data latches 308i and 309i and the first and second 
half words OPa and OP3 of the instruction code 
aligned on the upwardly contiguous 4-half-word 
boundary which are respectively held in the data 
latches 306j and 307j, are respectively outputted 
onto the signal lines 3006, 3007, 3004 and 3005, 
thereby producing the instruction code including 
the four half words beginning at the third half word 
on the 4-half-word boundary indicated by the read 
address n + 4 stored in the read pointer 1 and the 
half-word identifying register 2 through the signal 
lines 1005 as shown in Fig. 50. 

(4) When the value of the half-word identifying 
register 2 is 3 

In response to the signal on the signal line 3001 i 
and the value of the signal line 1002, the data 
drivers 31 5i, 31 2j, 31 3j and 31 4j come into opera- 
tion so that the fourth half word OPa of the instruc- 
tion code aligned on the 4-half-word boundary 
which is held in the data latch 309i and the first, 
second and third half words OP*, OPs and OPs of 
the instruction code aligned on the upwardly con- 
tiguous 4-half-word boundary which are respec- 
tively held in the data latches 306j, 307j and 308j. 
are respectively outputted onto the signal lines 
3007, 3004, 3005 and 3006, thereby producing the 
instruction code including the four half words be- 
ginning at the fourth half word on the 4-half-word 
boundary indicated by the read address n + 6 
stored in the read pointer 1 and the half-word 
identifying register 2 through the signal lines 1005 
as shown in Fig. 5E. 

At this time, if a branch instruction is included 
in the instruction code outputted from the instruc- 
tion cache 3 and if its branch has been taken in the 
past, the possibility of the branch being taken again 
this time is high. For this reason, the branch history 
table 4 is searched by the value of the read pointer 
1 which is also obtained from the signal line 1001. 
This operation will be explained with reference to 
the case of the instruction alignment diagram of 
Fig. 6. The Figure shows the case in which branch 
instructions BccO, Bed, Bcc2 and Bcc3, each 
composed of a half word, are respectively arranged 
at positions n + 2. n + 6. n + 1 2 and n + 1 4 relative to 
the address n on the 4-half-word boundary. Let it 
be assumed "that these four branch instructions 
have been executed in the past so that the 
branches have been taken and the corresponding 
branch destination addresses, etc., have been 
stroed in the branch history table 4. Assuming now 
that the information relating to the branch instruc- 



tion BccO is stored in the ith entry of the branch 
history table 4, the information relating to the 
branch instruction Bed in the jth entry, the in- 
formation relating to the Bcc2 in the kth entry and 
5 the information relating to the . branch instruction 
Bcc3.in the mth entry, the high-order 29 bits or the 
addres n are. stored in each of the address latches 
403i and 403j, the high-order 29 bits of the address 
n + 8 (i.e., the high-order 29 bits of the address n 

jo plus 1) in each of the address latches 403k and 
403m, the high-order 29 bits of the address n 
minus 1 in each of the pre-address latches 404i. 
and 404j, and the high-order 29 bits of the address 
n + 8 minus 1 (i'.e., the high-order 29 bits of the 

is address n). Also, 1 is stored in the branch position 
identifying latch 4Q5i, 3 in the branch position iden- 
tifying latch 4Q5j, 2 in the branch position identify- 
ing latch 405k and 3 in the branch position identify- 
ing latch 405m. Further, the high-order 31 bits of 

20 the branch destination address of the branch in- 
struction BccO are stored in the data latch 406i, the 
high-order 31 bits of the branch destination ad- 
dress of the branch instruction Bed in the data 
latch 406j, the high-order 31 bits of the branch 

25 destination address of the branch instruction 8cc2 
in the data latch 406k and the high-order 31 bits of 
the branch destination address of the branch in- 
struction Bcc3 in the data latch 406m. Although the 
component elements with suffixes k and m are not 

30 shown in Fig. 3, they are the same "as the elements 
with the suffixes i and j. 

Assume a case in which the four half words 
following the hatched address n + 6 in Fig. 6 are 
read from the instruction cache 3. in this case, the 

35 value of the read pointer 1 represents the high- 
order 29 bits of the address n and the value of the 
half-word identifying register 2 is 3. In the first 
address tag comparators 407, the equality between 
the high-order 29 bits of the address n obtained 

40 from the signal line 1001 and the value of the 
address latch 403 is detected so that the first 
address tag comparator 407i of the ith entry gen- 
erates a signal on a signal line 4001 i and the first 
address tag comparator 407j of the jth entry gen- 

45 erates a signal on a signal line 4001 j. Also, in the 
second address tag comparators 408. the equality 
between the high-order 29 bits of the address n 
obtained from the signal line 1001 and the value of 
the pre-address latch 404 is detected so that the 

so second address tag comparator 408k of the kth 
entry generates a signal on a signal line 4002k and 
the second address tag comparator 408m gen- 
erates a signal on a signal line 4002m. In response 
to the signals on the signal lines 4001 and 4002, 

55 the read entry preference circuits 409 of the re- 
spective entries operate in the following manner. 
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(1) The read entry preference circuit 409i of the ith 
entry 

The subtracter 41 3i subtracts the value or the half- 
word identifying register 2 from the value of the 
branch position identifying latch 405i so that the 
carry CRY becomes 0 and the logical circuit 415i 
generates a logical 0. In response to this output, a 
logical 1 is generated from the gate of each of the 
three open collectors of the logical circuit 41 6i and 
therefore signal lines 4006, 4007 and 4008 are not 
caused to go to a low potential. Also, a logical 0 is 
generated on the signal line 4003i. The foregoing is 
due to the fact that the branch instruction BccO is 
not included in the four half words read from the 
instruction cache 3. 

(2) The read entry preference circuit 409j of the jth 
entry 

The subtrator 41 3j subtracts the value of the half- 
word identifying register 2 from the value of the 
branch position identifying latch 405j so that the 
carry CRY becomes 1 and the logical circuit 41 5j 
generates a logical 1 in response to the carry CRY 
and the logical 1 appearing on the signal line 
4001 j. Also, the value of the difference OUT be- 
comes 0 and the decoder 41 4j changes its output 
lines at 0 to a logical 1. In response to these 
inputs, the logical circuit 41 6j generates a logical 0 
at the gate of the highest-stage open collector so 
that the signal line 4006 goes to the low potential 
and a logical 1 is generated on the signal line 
4003j. The foregoing operation is due to the fact 
that the branch instruction Bed is included in the 
four half words read from the instruction cache 3 
and that the address of the branch instruction Bed 
is nearest to the read address. 

(3) The read entry preference circuit 409k of the 
kth entry 

The subtracter 41 3k subtracts the value of the half- 
word identifying register 2 from the branch position 
identifying latch 405k so that the carry CRY be- 
comes 0 and the logical circuit 41 5k generates a 
logical 1 in response to the carry CRY and the 
logical 1 produced on the signal line 4002k. AJso, 
the value of the difference OUT becomes 3 and the 
decoder 41 4 changes its output line indicating 3 to 
a logical 1. However, since the signal line 4006 is 
at the low potential or the logical 0, the logical 
circuit 416k cannot generate a logical 0 at any of 
the open collector gates and a logical 0 is gen- 
erated on the signal line 4003k. The foregoing 
operation is due to the fact that while the branch 
instruction 8cc2 is included in th four half words 
read from the instruction cache 3. there exists 



another branch instruction at the closer address io 
the read address. 

(4) The read entry preference circuit 409m of the 
5 mth entry 

The subtracter 413m subtracts the value of the 
half-word identifying register 2 from the value of 
the branch position identifying latch 405m so that 

io the carry CRY becomes 1 and the logical circuit 
415m generates a logical 0. In response to the 
logical 0, the logical circuit 416m generates a logi- 
cal 1 at the gate of each of its three open collec- 
tors and the signal lines 4006, 4007 and 4008 are 

;s prevented from going to the low potential. Also, a 
logical 0 is generated on the signal line 4003m. 
The foregoing operation is due to the fact that the 
branch instruction Bcc3 is not included in the four 
half words read from the instruction cache 3. 

20 In response to the read entry preference circuit 

409j through the signal line 4003), the data driver 
41 Oj comes into operation and the high-order 31 
bits of the branch destination address of the branch 
instruction Bed, which is held in the data latch 

25 406j. are outputted onto a signal line 1009. Also, in 
response to the signal line 4003j, the logical circuit 
412 outputs a hit signal on a signal line 1010. 

In response to the hit signal on the signal line 
1010, the pointer input selector 6 switches the 

ao input of the read point 1 from the output of the 
pointer incrementer 5 to the high-order 29 bits of 
the branch destination address of the branch in- 
struction Bccl received from the branch history 
table 4 through the signal line 1009 and simulta- 

05 neously the low-order 2 bits from the signal line 
1009 or the half-word identifying 2 bits of the 
branch destination address of the branch instruc- 
tion Bed are stored in the half-word identifying 
register 2. Then, in accordance with the values of 

40 the read pointer. 1 and the half-word identifying 
register 2. the instruction cache 3 similarly reads 
the four haJf words indicated by the branch destina- 
tion address of the branch instruction Bed. 

From the foregoing it will be seen that in accor- 

4s dance with the present embodiment the instruction 
cache is realized in which the address decoder 304 
selects two of the entries so that the address tags 
of these entries are respectively compared in the 
first and second address tag comparators 316 and 

50 317 to verify that the instruction codes stored in the 
two entries are contiguous and also the data dri- 
vers 312 to 315 are controlled in accordance with 
the value of the half-word identifying register 2. 
thereby reading the instruction code of 4 half words 

55 long, which is indicated by any given read address, 
from the two entries Irrespective of the instruction 
alignment. 

Further, in accordance with the present em- 
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bodiment the branch history table is realized in 
which each of the entries is provided with the pre- 
address latch 404 for storing the address on ihe 4- 
half-word boundary preceding the address on the 
4-half-word boundary of a branch instruction and 
the second address tag comparator 408 for detect- 
ing the presence of equality between the stored 
address and the read address thereby extracting all 
the entries storing the branch destination address- 
es of the branch instructions present in the 8 half 
words aligned" on the 4-half-word boundaries com- 
pletely containing each 4 half words read from the 
instruction cache 3, and the necessary selection is 
effected by the read entry preference circuit 409 in 
accordance with the values of the half-word iden- 
tifying register 2 and the half-word identifying 2 bits 
of each of the branch instruction addresses, thus 
reading the branch destination address of the 
branch instruction arranged in the nearest position 
to the read address within the 4 half words begin- 
ning at the read address. 

Still further, in accordance with the present 
embodiment the instruction fetch device is realized 
in which the instruction cache 3 and the branch 
history table 4 are searched by the read address 
so that when the branch history table 4 hits, the 
instruction cache 3 and the branch history table 4 
are searched again by the branch destination ad- 
dress obtained from the branch history table 4 with 
the result that the desired instruction code of 4 half 
words long can always be read irrespective of the 
instruction alignment and that only the prediction of 
the branch of that branch instruction which is to be 
executed first is performed irrespective of the posi- 
tion of the branch instruction in the instruction code 
read and irrespective of the number of branch 
instructions included in the instruction code. 

Referring to Fig. 7, there is illustrated a de- 
tailed block diagram of the instruction cache 3 in a 
cache device and an instruction fetch device for 
supplying a fixed word length of a variable length 
instruction code according to a second embodi- 
ment of the present invention. The remaining com- 
ponent parts are the same as in the first embodi- 
ment shown in Fig. 1. In Fig. 7, numeral 321 
designates an address tag section for storing the 
high-order 29 bits of each address and effecting a 
search by the value of the read pointer 1, 322 a 
data section for storing each 64-bit instruction code 
aligned on a 4-haif-word boundary and selecting 
the instruction code of the entry searched by the 
address tag section 321 by the value of the half- 
word identifying register 2 to output the instruction 
code of 4 half words long, 323 data latches each 
for holding the high-order 29 bits of an address, 
324 pre-address latches each for holding a value 
obtained by subtracting 1 from the high-order 29 
bits of an address, 325, 326, 327 and 328 data 



latches for respectively ho/ding the first, second, 
third and fourth half words of an instruction code 
aligned on a 4-half-word boundary, 329 first ad- 
dress tag comparators each for detecting the pres- 
5 ence of equality between the value of the address 
latch 323 and the read address held in the read 
pointer 1, 330 second address tag comparators 
each for detecting the presence cf equality be- 
tween the value of the pre-address latch 324 and 
ro the read address held in the read pointer 1, 331 
data drivers each for writing the first half word in 
the data latch 325 of the entry into which the 
instruction code is written, reading the data latch 
325 of the entry for which the equality is detected 

/5 by the first address tag comparator 329 when the 
value of the half-word identifying register 2 is 0 and 
reading the data latch 325 of the entry for which 
the equality is detected by the second address tag 
comparator 330 when the value of the half-word 

20 identifying register 2 is greater than 1, 332 data 
drivers each for writing the second half word in the 
data latch 326 of the entry into which the instruc- 
tion code is written, reading the data latch 326 of 
the entry for which the equality is detected by the 

25 first address tag comparator 329 when the value of 
the half-word identifying register 2 is less, than 1 
and reading the data latch 326 of the entry for 
which the equality is detected by the second ad- 
dress tag comparator 330 when the value of the 

30 half-word identifying register 2 is greater than 2, 
333 data drivers each for writing the third half word 
in the data latch 327 of the entry into which the. 
instruction code is written, reading the data latch 
327 of the entry for which the equality is detected 

35 by the first address tag comparator 329 when the 
value of the half-word identifying register 2 is less 
than 2 and reading the data latch 327 of the entry 
for which the equality is detected by the second 
address tag comparator 330 when the value of the 

40 half-word identifying register 2 is 3, and 334 data 
drivers each for writing the fourth half word in the 
data latch 328 of the entry into which the instruc- 
tion code is written and reading the data latch 328 
of the entry for which the equality is detected by 

<*s the first address tag comparator 329. The suffixes i 
and j of the numerals 323 to 334 designate respec- 
tively the ith and jth entries. Numeral 335 des- 
ignates an address subtracter for decreasing the 
high-order 29 bits of an address by 1. 336 a first 

so OR circuit for producing the logical sum of the 
outputs from ail the first address tag comparators 
329. 337 a second OR circuit for producing the 
logical sum of the outputs from all of the second 
address tag comparators 330, 338 an ANO circuit 

55 for producing the logical product of the outputs 
from the first and second OR circuits 336 and 337 
to generate a signal indicating that the instruction 
codes of the entries for which the equality is de- 
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tected by the first and second address tag com- 
parators 329 and 330. respectively, are contiguous, 
and 339- a replace entry decision circuit for deter- 
mining the entry to be written when storing the 
instruction code in the data section 322. 

With the construction described above, the op- 
eration of the cache device and the instruction 
fetch device according to the second embodiment 
will now be described. In this case, however, a 
description will be made only of the writing and 
reading operations of the instruction cache 3 which 
are different from the first embodiment. 



<The writing in the instruction cache 3> 

Ignoring the value of the half-word identifying 
register 2. an instruction code of 4 half words long, 
which is separated by a 4-half-word boundary, is 
read from an external storage and at this t.me the 
high-order 29 bits of the. read address (the low- 
order 3 bits are 0s and are not needed) the 64-bit 
instruction code read are respectively stored 
through signal lines 1003 and 1004 in the instruc- 
tion cache 3. In the instruction cache 3. the entry 
for storage is selected by the replace entry de- 
cision circuit 339. Assuming now that the ith entry 
shown in Fig. 7 is selected, the high-order 29 bits 
of the address are written into the address latch 
323i through the signal line 1003 and also the 
address subtracter 335 decreases the same 29 bits 
by 1 and writes them in the pre-address latch 324. 
through a signal line 3013. Similarly, the data dn- 
vers 331 i. 332i. 333i and 3341 come into operation 
so that the first, second, third and fourth half words 
! of the instruction code are inputted through the 
signal lines 1004 and written into the data latches 
325i. 326i. 327i and 328i, respectively. 

<The reading from the instruction cache 3> 

In this case, the read address need not be the 
one on the 4-half-word boundary. For purposes of 
simplicity, a description will be made by referring 
again to the case of the instruction alignment dia- 
grams, of. Figs. 5A to 5E. Assuming now that the 4- 
half-word instruction code having the address n on 
the 4-half-word boundary is stored in the ith entry 
and the 4-half-word instruction code of the address 
n + 8 on the next 4-half-word boundary is stored in 
the jth entry, the high-order 29 bits of the address 
n is stored in the address latch 323i. the high-order 
29 bits of the address n+8 (i.e.. the: high-order 29 
bits of the address n plus 1) in the address latch 
323j. the high-order 29 bits of the address n minus 
1 in the pre-address latch 324i. the high-order 29 
bits of the address n*8 minus 1 (i.e.. the h.gh- 



order 29 bits of the addres n) in the pre-address 
latch 324j, the instruction code OPa - OP 3 in the 
data latches 325i to 328i and the instruction code 
OPi - OP? in the data latch 325j to 328j. Assume a 
s case in which this instruction cache 3 is read by 
the read address which is any one of the ad- 
dresses n, n + 2. n + 4 and n + 6. At this time, the 
value of the read pointer 1 represents the high- 
order 29 bits of the address n and the value of the 
,0 half-word identifying register 2 is 0. 1 . 2 or 3. In the 
first address tag comparators 329. the presence of 
equality between the high-order 29 bits of the ad- 
dress n received from the signal line 1001 and the 
value of the address latches 323 is detected and 
, 5 the first address tag comparator 329i of the ith 
entry generates a signal on a signal line 301 1i. v 
Also, in the second address tag comparators 330. 
the presence of equality between the high-order 29 
bits of the address n received from the signal line 
20 1001 and the value of the pre-address latches 324 
is detected and the second address tag comparator 
330j of the ith entry generates a signal or a signal 
line 3012]. In response to the signals on the signal 
lines 30111 and 301 2j. the data drivers 331 i to 334) 
2S come into operation and their operations differ .n 
dependence on the value of the half-word identify- 
ing register 2 obtained from the signal line 1002. 

30 (1) When the value of the half-word identifying 
register 2 is 0 

In response to the signal on the signal line 301 1i 
and the value of the signal line 1002. the data 
3S drivers 331 i. 332i. 333i and 334i come into opera- 
tion so that of the instruction code aligned on the 
4-half-word boundary, the first half word OPo. the 
second half word OP,, the third half word OP 2 and 
the fourth half word OP 3 . which ^e respectively 
<o held in the data latches 32Si. 326i. 3271 and 328k 
are outputted on signal lines 3014. 3015. 3016 and 
3017. thereby producing the instruction code in- 
cluding the 4 half words beginning at the first han 
word on the 4-half-word boundary indicated by me 
4s read address n stored in the read pointer 1 and the 
half-word identifying register 2 through the signal 
lines 1005 as shown in fig. 58. 

(2) When the value of the half-word identifying 
50 register 2 is 1 

The data drivers 332i. 333i and 334i come into 
operation in response to the signal on the signal 
line 30115 and the value of the signal line 1002 and 
55 the data driver 331] comes into operation in re- 
sponse to the signal on the signal line 3012, so that 
of the instruction code aligned on the ^.f-w°rd 
boundary the second half word OP,, the third half 
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word OP 2 and the fourth half word OPi, which are 
respectively held in the data latches 326i, 327i and 
328i, and the first half word OPa of the instruction 
code aligned on . the upwardly contiguous 4-haif- 
word boundary, which is held in the data latch 325j, 
are respectively outputted on the signal lines 3015, 
3016, 3017 and 3014, thereby producing the in- 
struction code including the four half words begin- 
ning at the second half word on the 4-haif-word 
boundary indicated by the read address n + 2 
stored in the read pointer 1 and the half-word 
identifying register 2 through the signal lines 1005 
as shown in Fig. 5C. 

(3) When the value of the half-word identifying 
register 2 is 2 

The data drivers 333i and 334i come into operation 
in response to the signal on the signal line 301 1 i 
and the value of the signal line 1002 and the data 
drivers 331 j and 332j come into operation in re- 
sponse to the signal on the signal line 301 2j and 
the value of the signal line 1002 so that the third 
half word OP 2 and the fourth half word OP 3 of the 
instruction code aligned on the 4-half-word bound- 
ary, which are respectively held in the data latches 
327i and 328i, and the first half word OP* and the 
second half word OP s of the instruction code 
aligned on the upwardly contiguous 4-half-word 
boundary, which are respectively held in the data 
latches 325j and 326j, are respectively outputted 
on the signal lines 3016, 3017, 3014 and 3015, 
thereby producing the instruction code including 
the 4 half words beginning at the third half word on 
the 4-half-word boundary indicated by the read 
address n + 4 stored in the read pointer 1 and the 
half-word identifying register 2 through the signal 
lines 1005 as shown-in Fig. 50. • 

(4) When the value of the half word identifying 
register 2 is 3 

The data driver 334i comes into operation in re- 
sponse to the signal on the signal line 301 1 i and 
the value of the signal line 1002 and the data 
drivers 331 j t 332j and 333j come into operation in 
response to the signal on the signal line 301 2j and 
the value of the signal line 1002 so that the fourth 
half word OP3 of the instruction code aligned on 
the 4-half-word boundary, which is held in the data 
latch 328i, and the first half word OP*, the second 
half word OPs and the third half word OPs of the 
instruction code aligned on the upwardly contig- 
uous 4-half-word boundary, which are respectively 
held in the data latches 325j, 326j and 327j, are 
respectively outputted on the signal lines 3017, 
3014, 3015 and 3016, thereby producing the in- 
struction code including the four half words begin- 



ning at the fourth half word on the 4-half-word 
boundary indicated by the read address n + 6 
stored in the read pointer 1 and the half-word 
identifying register 2 through the signal lines 1005 

5 as shown in Fig. 5E. 

Also, in response to the signals from the signal 
lines 301 1 i and 3012], each of the OR circuits 326 
and 337 generates a logical 1 signal and the AND 
circuit 338 generates a signal indicating that the ith 

10 and jth entries hit and their instruction codes are 
contiguous. 

From the foregoing description it will be seen 
that in accordance with the second embodiment 
the instruction cache is realized in which each 
75 entry is provided with the pre-address latch 324 for 
storing the address on the 4-half-word boundary 
preceding the address on the 4-half-word boundary 
of an instruction code and the second address tag ' 
comparator 330 for detecting the presence of 

20 equality between the value of the former and the 
read address whereby two of the entries are ex- 
tracted which include the address on the 4-half- 
word boundary held in the read pointer and the 
address on the next 4-half-word boundary in the 

25 address tag section 321, and the data drivers 331 
to 334 are controlled in accordance with the value 
of the half-word identifying register 2, thereby read- 
ing the instruction code of a 4-half-word word 
length by the given read address from these two 

00 entries irrespective of the instruction alignment. 

It is to be noted that while, in the first embodi- 
ment, during the reading of the instruction cache 3 
it is verified that the instruction codes in the data 
sections of the succeeding two entries selected by 

3S the first address tag comparator 316 and the sec- 
ond address tag comparator 317 are the contig- 
uous codes, it is possible to arrange so that in the 
instruction cache 3 each of the entries is provided 
with a flag indicating that the instruction code of 

40 this entry is contiguous to the instruction code of 
the preceding entry and these flags are set when 
storing the contiguous instruction codes in the suc- 
ceeding entries, thereby allowing to verify during 
reading that the instruction codes in the data sec- 

45 tions of the selected succeeding two entries are 
contiguous by means of the flags. 

Also, while, in the first embodiment, there are 
provided the data drivers 312 to 314 which are 
controlled in response to the output lines of the 

50 address decoder 304 and the value of the half-word 
identifying register 2 and the data drivers 315 
which are operated in response to only the outputs 
of the address decoder 304 whereby the desired 4 
half words are selected from the 8 half words 

55 stored in the data latches 306 to 309 of the two 
entries selected by the address decoder 304 and 
are outputted. it is possible to arrange so that there 
are provided data drivers which are operated in 

12 
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response to only the outputs of the address de- 
coder 304 whereby the 8 half words stored in the 
data latches 306 to 309 of the two entries selected 
by the address decoder 304 or the 7 half words 
excluding the last half word are outputted and the 
desired 4 half words are selected from these 8 or 7 
half words in accordance with the value of the half- 
word identifying register 2 externally of the instruc- 
tion cache 3. 

Further, while, in the first embodiment, the 
number of entries in the instruction cache 3 is set 
to 128 the embodiment can be worked out by 
using any arbitrary number of entries. If the num- 
ber of entries is selected the nth power of 2 (in this 
embodiment n = 7). it is only necessary to select 
so that the number of bits is n in the selector 303 
and the address decoder 304 and the number of 
bits is 29-n in all the component elements of the 
address tag section 301 and the first and second 
address tag comparators 316 and 31 7. 

Still further, while, in- the first or second em- 
bodiment, the 4 half words aligned on a 4-half-word 
boundary are stored in each entry of the data 
section 302 or 322 in the instruction cache 3 to 
select the desired 4 half words from the two entr.es 
holding the contiguous instruction codes, it is pos- 
sible to store only 2 half words or single half word 
aligned on a 2-half-word boundary in each entry of 
the data section thereby selecting the des.red I 4 
hart words from the 3 or 4 entries hold.ng the 
contiguous instruction codes or conversely to store 
an instruction code of more than 8 half words 
aligned on a 8-half-word boundary in each entry ot 
the data section thereby selecting the desired 4 
half words from the two entries holding the contig- 
uous instruction codes. 

Still further, while, in the first or second em- 
bodiment, the branch history table 4 includes the 
pre-address latches 404 and the address subtracter 
41 1 whereby during the writing of a branch destina- 
tion address in the branch history table 4 both the 
high-order 29 bits of the branch instruction address 
and the value representing the former minus 1 are 
stored in the address tag section 401 thereby de- 
tecting the entry in which the high-order 29 bits of 
the branch instruction address hits either of these 
two values during the searching of the branch 
history table 4. it is possible to provide^ an address 
adder which adds 1 to the high-order 29 bits of the 
read address so that only th high-order 29 bits of 
the address are stored in the address tag section 
401 thereby detecting that entry in which the value 
of the address tag section 401 hits the high-order 
29 bits of the read address or the same value but 
increased by 1 during the search of the branch 
history table 4. 

Still further, while, in the first or second em- 
bodiment, each of the read entry preference c.r- 



cuits 409 includes the subtracter 413. the decoder 
415 and the logical circuits 415 and 416 as sepa- 
rate elements, all of them can be combined in one 
to perform the same functions for the purpose of 
s simolifying the circuitry. 

Still further, while, in the second embodiment, 
the instruction cache 3 includes the pre-address 
latches 324 and the address subtracter 335 where- 
by during writing of the instruction code in the 
,o instruction cache 3 both the high-order 29 bits of 
the address and the same value but decreased by 
1 are stored in the address tag section 321 so as 
to search that entry in which the high-order 29 bits 
of the read address hits either of these values 
,s during reading of the instruction code from the 
instruction cache 3. it is possible to provide an 
address adder which adds 1 to the high-order 29 
bits of the read address so that only the high-order 
29 bits of the address are stored in the address tag 
20 section 321 thereby searching that entry in which 
the value of the address tag section 321 hits the 
high-order 29 bits of the read address or the same 
value but increased by 1 during reading of the 
instruction code from the instruction cache 3. 
25 Still further, while, in the second embodiment, 

the instruction cache 3 includes the data drivers 
331 to 333 which are controlled in response to the 
output lines of the first and second address tag 
comparators 329 and 330 and the value of the half- 
30 word identifying register 2 and the data drivers 334 
which are operated only in response to the outputs 
of the first address tag comparators 329 whereby 
the desired 4 half words are selected from the 8 
half words stored in the data latches 325 to 328 of 
3S the two entries selected by the first and second 
address tag comparators 329 and 330 and are 
outputted. the instruction cache 3 may be provided 
with only data drivers which are operated in re- 
sponse to the outputs of the first or second ad- 
<o dress tag comparators 329 or 330 "hereby the 8 
half words stored in the data latches 325 to 328 of 
the selected two entries or the 7 half words exclud- 
ing the last half word are outputted and the desired 
4 half words are selected from the 8 half words or 
« the 7 half words in accordance with the value of the 
half-word identifying register 2 externally of the 
instruction cache 3. • 

Still further, while the instruction cache 3 >s 
formed by the direct mapping method in the first 
so embodiment and by the full associative method in 
the second embodiment, it may be formed by the 
set associative method. 

From the foregoing description it will be seen 
that in accordance with the first aspect of *e 
S5 present invention the storage contents of the de- 
sired word length can be read by any arbrfrary 
read address irrespective of the alignment of the 
storage contents on the main storage. 
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In accordance with the second aspect of the 
present invention, it is possible to read the storage' 
contents forming a set with identifying information 
nearest to identifying information for identifying the 
subject storage contents of a read operation and 
falling within a predetermined range. 

In accordance with the third aspect of the in- 
vention, an instruction string of a predetermined 
word length can always be read out and it is 
possible to effect only the prediction of a branch of 
the branch instruction which must be executed first 
irrespective of the position of the branch instruction 
in the instruction string read or irrespective of the 
number of the branch instructions included. 

In accordance with the first and second as- 
pects of the present invention the construction 
comprises regular repetitions and therefore even its 
incorporation in a semiconductor integrated circuit 
can be realized without any decrease in packing 
density and also the increase in the design period 
can be minimized. 

It will thus be seen that the present invention 
has a very great practical effect. 

Claims 

1. A cache device comprising: 

storage means (301, 302) including a plurality of 
entries each storing a part of a main storage repre- 
senting a unit of storage capacity of at least one 
word which is identified by an address and said 
address or a part thereof as a set; 
searching means (304) for selecting specified one 
of said entries whose stored address or a part 
thereof is coincident with a read address and at 
least one another of said entries holding storage 
contents contiguous to storage contents stored in 
said specified one entry during a reading operation 
of said storage means; and 

storage contents selecting means (312-315) where- 
by the storage contents stored in said at least one 
or more entries selected by said searching means 
are selected by every storage unit stored in said 
selected at least one or more entries in correspon- 
dence to said desired read address to extract said 
storage contents of a desired word length/thereby 
reading the storage contents of said desired word 
length from said at least one or more entries. 

2. A cache device comprising: 

storage means (301, 302) including a plurality of 
entries for storing' an instruction string divided into 
minimum units of instruction and including a given 
number of words and a high-order part of a cor- 
responding address on a main storage storing said 
instruction string as a set in each one of said 
entries determined by a tow-order part of said 
address: 



searching means (304) whereby during an instruc- 
tion string reading operation a low-order part of a 
read address is decoded such that of one of said 
entries determined by a result of said decoding 

5 and at least one another of said entries succeeding 
said one entry those in each of which the high- 
order part of the address stored therein coincides 
with the high-order, part of said read address are 
selected from said storage means: and 

jo storage contents selecting means (312-315) where- 
by a set of instruction strings having contiguous 
addresses and stored in said at least, one or more 
entries selected by said searching means are se- 
lected and extracted by every minimum unit of 

75 instruction in accordance with said desired read 
address, thereby reading the instruction string of a 
desired word length from at least one or more of 
said entries. 

3. A cache device comprising:. 

20 storage means (321, 322) including a plurality of 
entries each storing an instruction string divided 
into minimum units of instruction and including a 
given number of words, a corresponding address 
on a main storage storing said instruction string 

25 and a value obtained by subtracting a given value 
from said address as a set; 

searching means (329, 330) for selecting from said 
storage means specified ones of said entries in 
which the stored address or a value obtained by 
30 decreasing a given value from said address co- 
incides with a read address during an instruction 
string reading operation; and 

storage contents selecting means (331-334) where- 
by a set of instruction strings having contiguous 

35 addresses and stored in said at least one or more 
entries selected by said searching means are se- 
lected and extracted by every minimum unit of 
instruction in accordance with said desired read 
address, thereby reading the instruction string of a 

40 desired word length from said at least one or more 
entries. 

4. A cache device comprising: 

storage means (401, 402) including a plurality of 
entries each storing storage contents and identtfy- 

45 ing information for identifying said storage contents 
as a set- 
searching means (407, 408) for selecting from said 
storage means specified one of said entries whose 
stored identifying information coincides with iden- 

50 tifying information for identifying subject storage 
contents of a reading operation on said storage 
means and at least one another of said entries 
contained within a given range determined by said 
identifying information stored in said specified one 

55 entry; and 

storage contents selecting means (409) whereby of 
said at least one or more entries selected by said 
searching means that entry stored identifying in- 
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formation has a minimum difference from sa.d 
identifying information for identifying sa.d suo.ect 
storage contents of said reading operation . se- 
lected to output the storage contents stored .here- 
in, thereby reading from said at least one or more 
entries said storage contents associated as a set 
with said identifying information nearest to said 
identifying information for identifying the suo.ect 
storage contents of the reading operation ano fall- 
ing within a predetermined range. 

5. A cache device comprising: 
storage means (401. 402) including a plurality of 
entries each storing a branch destination address 
of a branch instruction, a corresponding address on 
a main storage storing said branch instruction and 
a value obtained by decreasing a given value from 
said address as a set; 

first searching means (407) for selecting from M»d 
storage means specified ones of said entries 
whose stored addresses coincide with a read ad- 
dress during an instruction string reading operation: 
first storage contents selecting means (409) for 
selecting from said entries selected by sa.d first 
searching means one which produces a zero or 
smallest positive difference when sa.d instn^on 
read address is subtracted from the branch instruc- 
tion address stored therein and outputting the 
branch destination address stored in sa.d one en- 

slcond searching means for selecting from .said 
storage means particular ones of sa.d plurality of 
entries each storing the va.ue obtained by.decreas- 
ing a given va.ue from the address wh.ch co.nades 
with said read address during an instructs stnng 
reading operation; and 
• second storage contents selecting mean (409) 
whereby when said first storage conten » 
means produces no output, one 
selected by said second searching means is se 
,ected which produces a negative difference of a 
smallest absolute va.ue when said instructor .read 
address is subtracted from the va.ue obtame I by 
subtracting a given value from the branch ^instruc- 
tion address stored therein, thereby read. ng *e 
branch destination of said branch instruction near- 
est to said instruction read address and fallmg 
within a predetermined range from said at least one 

or more entries. . . 

6. An instruction fetch device compns.ng: 
read address holding means (1) for hold.n? | an 
address on a main storage storing an instruction 
string subject to reading; 

instruction storage means (3) inc.ud.nc , storage 
means (301. 302) including a plurality 
each storing a part of a main storage ^resentmg 
a unit of storage capacity of at least one .word and 
Tdentifiable by an address and -id address or 
part thereof as a set. search.ng means (304) for 



selecting from said storage means specified one of 
said entries whose stored address or a part thereof 
coincides with a read address and at least one 
another of said entires holding storage contents 
s contiguous to storage contents stored in sa.d 
specified one entry during a read operation or sa.d 
storage means, and storage contents selecting 
means (312-315) whereby the storage contents 
stored in said at least one or more entries selected 
,o by said searching means are selected by every 
storage unit stored in said selected entries in cor- 
respondence to said desired read address, thereby 
extracting said storage contents having a des.red 
word length; 

, 5 branch predicting means (4) includ.ng storage 
means (401, 402) including a plurality of entries 
each storing storage contents and identifying in- 
formation for identifying said storage contents as a 
set searching means (407, 408) for selecting from 
20 said storage means specified one of said I entr.es 
whose stored identifying information co.nades w.th 
identifying information for identifying subiect stor- 
age contents of a reading operation on sa.d storage 
means and at least one another of said entr.es 
2S contained within a given range determined by sa.d 
identifying information stored in said spec.fied one 
entry, and storage contents selecting means (409) 
whereby of said at least one or more entries se- 
lected by said searching means that entry whose 
30 stored identifying information has a m.n.mum de- 
ference from said identifying information for .den- 
tifying said subject storage contents of said read.ng 
operation is selected to output the storage contents 
stored therein; and 
35 address incrementing means (5) for mcreas.ng a 
value of said read address holding means by a 
given value determined by the number of words in 
said instruction string read from sa.d instruction 
storage means when said branch pred.ct.ng means 
40 outputs no branch destination address. 

whereby an instruction string of a predeterm ned 
number of words is a.ways read from 
ion storage means and also on.y the pred-ction of 
a branch of that branch instruction wh.ch « i to*e 
. s executed first is performed irrespective or the pos - 
tion and number of branch instructions .nc.uded .n 
an instruction string read. 

7. An instruction fetch device ^P™"* 
read address holding means (1) for hold ng an 
50 address on a main storage storing an .nstruction 
string subject to reading; ^ or aae 
instruction storage means (3) «^JBJW 
means (301. 302) including a plural.ty of entr.es or 
ZZg an instruction string divided into m.n.mum 
S5 units of instruction and including a given number o, 
words and a high-order part of the correspond.ng 
address on said main storage stonng sa.d mstruc 
tion string as a set in each of said entries deter- 



15 



EP 0 372 865 A2 



23 



mined by a low-order part of said address, search- 
ing means (304) whereby during an instruction 
string reading operation a low-order part of a read 
address is decoded such that of one of said entries 
determined by a result of said decoding and at 5 
least one another of said entries succeeding said 
one entry those in each of which the high-order 
part of the address stored therein coincides with 
the high-order part of said read address is se- 
lected, and storage contents selecting means (312- jo 
315) whereby a set of instruction strings having 
contiguous addresses and stored in said at least 
one or more entries selected by said searching 
means are selected and extracted by minimum unit 
of instruction in accordance with said desired read is 
address or instruction storage means (3) including 
storage means (321, 322) including a plurality of 
entries each storing an instruction string divided 
into minimum units of instruction and including a 
given number of words, a corresponding address 20 
on said main storage storing said instruction string 
and a value obtained by subtracting a given value 
from said address as a set, searching means (329, 
330) for selecting from said storage means speci- 
fied ones of said entries in which the sotred ad- 25 
dress or the value obtained by decreasing a given 
value therefrom coincides with a read address dur- 
ing an instruction string reading operation, and 
storage contents selecting means (331-334) where- 
by a set of instruction strings having contiguous 30 
addresses and stored in said at least one or more 
entries selected by said searching means are se- 
lected and extracted by every minimum unit of 
instruction in accordance with said desired read 
address; os 
branch predicting means (4) including storage 
means (401, 402) including a plurality of entries 
each storing a branch destination address of a 
branch instruction, a corresponding address on 
said main storage storing said branch instruction 40 
and a value obtained by subtracting a given value 
from said address as a set, first searching means 
(407) for selecting from said storage means speci- 
fied ones of said entries whose stored addresses 
coincide with said read address during an instruc- 45 
tion string reading operation, first storage contents 
selecting means (409) for selecting from said en- 
tries selected by said first searching means one 
which produces ■ a zero or smallest positive dif- 
ference when said instruction read address is sub- so 
tracted from the branch instruction address stored 
therein and outputting the branch destination ad- 
dress stored in said one entry, second searching 
means (408) for selecting from said storage means 
particular ones of said plurality of entries each 55 
storing the value obtained by decreasing a given 
value from the address which coincides with said 
read address during an instruction siring reading 



operation, and second stroage contents selecting 
means (409) whereby when said first storage con- 
tents selecting means produces no output, one of 
said entries selected by said second searching 
means is selected which produces a negative dif- 
ference of a minimum absolute value when said 
instruction read address is subtracted from the 
value obtained by subtracting a given value from 
the branch instruction address stored therein to 
output the branch destination address stored there- 
in; and 

address incrementing means (5) for increasing a 
value of said read address holding means (1) by a 
given value determined by the number of words in 
the instruction string read from said instruction 
storage means when said branch predicting means 
generates no output, 

whereby an instruction string of a predetermined 
number of words is always read from said instruc- 
tion storage means and only the prediction of a 
branch of that branch instruction which is to be 
executed first is performed irrespective of the posi- 
tion and number of branch instructions included in 
an instruction string read. 
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